Graphix-T5: Mixing Pre-trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing

نویسندگان

چکیده

The task of text-to-SQL parsing, which aims at converting natural language questions into executable SQL queries, has garnered increasing attention in recent years. One the major challenges parsing is domain generalization, i.e., how to generalize well unseen databases. Recently, pre-trained text-to-text transformer model, namely T5, though not specialized for achieved state-of-the-art performance on standard benchmarks targeting generalization. In this work, we explore ways further augment T5 model with components parsing. Such are expected introduce structural inductive bias parsers thus improving model’s capacity (potentially multi-hop) reasoning, critical generating structure-rich SQLs. To end, propose a new architecture GRAPHIX-T5, mixed augmented by specially-designed graph-aware layers. Extensive experiments and analysis demonstrate effectiveness GRAPHIX-T5 across four benchmarks: SPIDER, SYN, REALISTIC DK. surpasses all other T5-based significant margin, achieving performance. Notably, GRAPHIX-T5-large reaches superior original T5-large 5.7% exact match (EM) accuracy 6.6% execution (EX). This even outperforms T5-3B 1.2% EM 1.5% EX

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Content-Aware Collaborative Music Recommendation Using Pre-trained Neural Networks

Although content is fundamental to our music listening preferences, the leading performance in music recommendation is achieved by collaborative-filtering-based methods which exploit the similarity patterns in user’s listening history rather than the audio content of songs. Meanwhile, collaborative filtering has the well-known “cold-start” problem, i.e., it is unable to work with new songs that...

متن کامل

Something Old, Something New - Applying a Pre-trained Parsing Model to Clinical Swedish

Information access from clinical text is a research area which has gained a large amount of interest in recent years. Automatic syntactic analysis for the creation of deeper language models is potentially very useful for such methods. However, syntactic parsers that are tailored to accommodate for the distinctive properties of clinical language are rare and costly to build. We present an initia...

متن کامل

Graph parsing with s-graph grammars

A key problem in semantic parsing with graph-based semantic representations is graph parsing, i.e. computing all possible analyses of a given graph according to a grammar. This problem arises in training synchronous string-to-graph grammars, and when generating strings from them. We present two algorithms for graph parsing (bottom-up and top-down) with s-graph grammars. On the related problem o...

متن کامل

Relational Database with Sql and Graph Database

This paper represents the study of various theory based on database model. In this paper a study of various papers is done, and in the reviewed paper graph database and SQL is done. This huge repository of unstructured data has resulted in making the data search and knowledge extraction, a very cumbersome task if one continues using the legacy relational database.

متن کامل

SnipSuggest: Context-Aware Autocompletion for SQL

In this paper, we present SnipSuggest, a system that provides onthe-go, context-aware assistance in the SQL composition process. SnipSuggest aims to help the increasing population of non-expert database users, who need to perform complex analysis on their large-scale datasets, but have difficulty writing SQL queries. As a user types a query, SnipSuggest recommends possible additions to various ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i11.26536